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The Leucine Zipper: A Hypothetical 
Structure Common to a New Class of 

DNA Binding Proteins 

^nuAML H. LiANpscHULz, Peter F. Johnson. Steven L. McKnigh 


A SO-amino-add wgmmt of C/BB£» a newiy dssoovcied 
enhancer Unding proisiii» shares notable Wfignrg sinii- 
lattajr with a segausat of the oeUnlar Myc transfimnhig 
pcotem. Displ^ of these xespectivc ammo add seqneDces 
on an idfaHwd a hcBx reffaJed a periodic rq^ettticm of 
kodne reddaes at every seventh position'over a ^«tan^ 
Goveroig e^jht helicd tunis. llie periodic arca^ 
four koones was also noted in nie sequences of the Eos 
and Juntraftsftirming proteins, as well as Aat of the yeast 
gene rcgnktoty pnyton, GGN4. The polyp^tidc 
ments containing diese peciodk arrays of leodne residues 
are lao^osed to exist in an a^elical conformation, and 
die Imcine side dsains giftgf^iy^g Scam (me a helix inter- 
digitate veith those di^layed fcom a similar a helix of a 
second polypeptide^ fedlhating dimcrfaation. This hypo- 
thetical structure is refer red to as the ^^oidne a rip p c r," 
and it may represent a diaracteristic p r op ert y of a new 
category of DnA binding protdns. 


THE MOlfiCOLAB. BASIS OB SPBCIBIC DnSSACnON BETWBSN 
prottans and DNA has been Ac sutjca erf intendvc study . 
Protdm that r^^date the vanous functions and mccaboHsm 
of DNA can be fixed at an apptc^ziate site of action by bonding 
avidly to ceitain DNA sequences. Proteins tiiat exhibit this property, 
tcnxicd sequenoc-spcdfic DNA binding protdns, play integral lofcs 
in DNA replicatxxi, reoHnbination, strand scission, and transciip* 
tion. Moreover, addidooal roles for sequencc-spcdfic DNA binding 
protdns will probably be discovered in processes rangii^ fixjm 
duomosoinc segregation during cyti^esis, to die cHxfcred fol^^ 
and unfolding of duomadn during successive cransitiofis becween 
interphase and mitosis. 

Tlve sequence-spedfic DNA binding proteins tliat have been 
studkd most extensively arc tiic gpnc activaior and rqxresscH: 
proteins of bacteria. Tlie acthv fiMins of dicse fwotcins arc 
ally symmetric dimcis, and their DNA binding sites are rotatk>oa% 
symmetric palindromes. X-ray ctystallographic studies have revealed 
a structural motif common co the DNA bindix^ dMnains of many 
bacterial repressors and activators (1). This DNA l»ndii^ motS^ 
termed *Ticlix-tum-heUx,* is chanHtcrized by two successive a 
hdices juxtaposed at a|^'roximatcly 9(f by z turn of four amino 
adds (2). Dimetizatkxi of this class of bactenal gene regulatory 
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protdns ananges <»ie a helix eadi monomer widi Its axnk)g so 
dut tiicy & into successive msyor grcMves of DNA. 

A second DNA binding motif^ cmuuon to many eokaryodc gene 
rcgDlatoty protdns^ has xecendy beba disocyvcccd The founding 
member of tius second fiunily is a metalloprotdDy temied TFmA,. 
wbktk acts as a positive reg giator of SS RNA gene expression (J). 
The amino add sequence of TPDIA has a Rpcatii^ motif consisting 
of two dosdy sjf^ced cysteines foUowod by two histuUnes (4). 
Biophysical stndies mdicate that a zinc ion is cetrahedtally erwdmatv 
ed by the cyseeine-hisddine mod^ and dut the oootdination oom- 
ffa t nyaits both stalMlity and DNA sequence-spedfidiy to the 
THHA polypepdde (5). Ihe amino add sequence (Oj^^ 
moti^ tamed, the "dnc finger,'* is bdng observed in an ever 
inaeasic^ number of eukaryotic gene iq;uklocy pzotems {6, 7). 

Two fiind a mmtal concepts have been established fiom studies on 
sequcQce-spedfic DKA bindii^ protdns. The first, which sads&s 
long-standing pKcdictions, holds that binding sped^^ 
direct atomic incetacdbn becween amino acid side ghaipf and base 
pairs m minimal^ distorted B DNA. There seems, however, to be 
no genexal code that ^>edfics die amino acid sur^ necessary to 

interact stably widi a ^vcn DNA sut&ce. In contrast, a second and 
less intuhivdy obvious omoqpt has emerged Thar is, a Unnted 
number of stmoucal niod& ^'steci^ the appropriate amino add ude 
chams of a protein into the grooves of doubk helicaK DNA wheie 
they can interaa directly Tiidi base paiis. 

composed i»*imarily of anuip acid residues that do not ooake direct 
contact widn the DNA. Ratiier, they fonn diree-dimcnsional'''scaf- 


Rg. 1. Hdkal whed analysis of a 
carixn]?l<eraaiiul portion of OEBP. 
The amino add soqncaoe of a portion 
of the C/EB? polypaode {11) is 
displayed cnd-tchcna cuaiwn the axis 
of a sdiemadc a The most 
anuoo-tcrauzial residue indodod in 
the analysts is leucine residne 315 of 
C/EBP, and is placed at posttion 
number one <tf dx idealized hdk. 
The second position on the helical 
wheel, which is occupied by ihieo- 
nine 316, is located at an azinutth 
di/^ la c ed by lOCf from position 
number one. The hdkal wheel con- 
sisjs oi seven ''spokes**, correspond- . 
ing to me fit or seven amino add residues into every two ee-hdical turns. A 
total of 28 anuno adds of OEBP are inchttled in die ana^ 
<arfoo3^-temiinal residue being ghituixine 342. The fflm y? add abbitvia- 
ti w we as folkws: A, alanine; Q cysteinq D, aqj^ 
aad; F, phenyialanioc; G, gtydnc; H, histidinc; I» isoleadnc; K, ^sine; L, 
leucine; M, methiomne; N, asparagmc; P, proline; Q, ghittminc; R, 
aiguuoe; S, serine; T, thieoz^ V, valine; W, nyptc^ian; Y, tytosiac. 
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Ffg. 2 (iGft). CbaservatioD of leudne repeals between c-Myc, n-Myc, i-Myc, 
Jun, Pos, and GCN4. Hie amino add scqacnoe of apoition of each piottia 
is displayed on a schematic a hdix accoming to the codvcnttons of Fig. I. 
The r^oQ of c-M^ selected for daspby had been observed to share 
substanda! scqncncc similarity with C/EBP (11)^ andstamarkudnc residue 
413 of the rnouse c-Myc p olypeptide (2S). The regiosis of n-Myc and XrMyc 
sckocd for display oonespottd to dK sanK cacbosyi^ani^ 
chosen for c-Myc The analysis of n-Myc staitod at Inirine residie 425^ and 
that ofL-Myc started at koanc residue 333 (29). Asl^htsimilazity m ammo 
add sequence has been observed among c-Myc, Jun, Fos, and the DNA 
trnxling domain of GCM4 {14^. Helicai whcd analyses were earned out on 
these segments scatting at leucme residues 465 of Jun (50)> of Fos (3i)y 
and 25 3 of GCN4 (32) . Amino add residues that share cdamty whh C^P 
(seeF^. 1) are boxed. Ftgw 3 (right). Helical net ana^ of a carboK^- 


teiminai r^ion of OEBP. The hdical region containing the leucine rq^eats 
ofOEBP was flattened inix> two dimensions bysplitdng the hdix lengthwise 
along the &ce opposite to the strip of leucines. display St 
left comer wtui vafine lesidne mmiber 306. The iint helical tmn is 
completed with ghttanune reskhie 309; the seoood cum is initiated by 
gmminc residue 310, and so on until prcrfine residue 344. The split along 
the hdbc bisects two residues, lytme 324. unA g^iraTn^iy- ^ As CTlA, each of 
these tesidoes is diqilayed tn diq>Ucate» corresponding to the '^st" and 
%st^ residues of helical turns 5 and 6 aod 10 and 11, respective^. 
Oppositely cfaatged residues located in portions suitable for ion paxrii^ 
(t ± 3 or t ± 4) are connected by a dashed Ime. 


folds^ diat march the contsour of DNA. These scafiblds dictate the 
appropriate positioning of die interacting protein su^^ 
atocnic interacdon berween amino add $idc chains and the base pairs 
diat ooQsdcutie a spcdfic binding site on DNA. Indeed^ ic is die 
amino add sequences of these scalds, cadier than suifiGe-oomaa- 
ing sequences^ that exhibtc salicnc pxocdt>'to-procein snnilarity (6, 

There arc ampk reasons to anddpaic licw anddifeent structural 
modfi for DNA landing proteins. For example, cryscallograf^ 
studies of the lestricdon endonudease Eco BI shovtr that its 
recognition specifidty is not established by dtfaer of the afotcmen- 
doned stnicniral modfi (P). M<»eovcr, the ainino add sequciices of 
at lease three newly kkndfied sequcnoe-spedfic DNA binding 
proteins have &iled to show relafittfacss to dther die *%ela-tUR>> 
hdhf' modfor the '^dnc finger^ cnodf(i^^ 7^). Finally, amino add 
sequence analyses of nudear onc<^ene products, which have been 
anddpatiDd to modulate gene expression by bindmg DNA ( 72), have 
£ulcd to show sunilarity to dtfacr of the csfahlishcsd DNA binding 
monfi. 

We no^ describe an amino add sequence modf common to five 
D>^ holding pooteins: three nudear tzansfooning proteins, and 
two tcanscriptional r^;ulatory proteins. The motif consists of a 
periodic rq>edtioa of leucine residues. We propose that the leucines 
extend from an unusually long a helix» and that the leudne side 
chains of one helix interdigicate with those of a matrJiing hdbc fiom 
a second polypeptide to form a stable nonoovalent hnkag^. Further^ 
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more, we predia that die paired bdices of diis class of proteins play 
a fundamental role in arranging the contact snr&cc for seqnciKe- 
specific interaction with DNA. 

A periodicity of kudnes. Rat liver nuclei contain a heat-stable 
protein that is capable of binding in a scqueiKe-spedfic manner to 
r^;uIatory DNA sequences common to a number of di^rcnr animal 
virus chromosomes (13), Two of the ds-r^ulatcwy DNA sequences 
to which this protein binds arc the **CCAAT homology^ common to 
many promoters of genes that encode messenger RNA, and the 
^enhancer core hi^nology^ common to many viral enhancers. Since 
it was not initiaily realized that the same protein accounted €or both 
DNA binding acdvides, it was variously termed CBP (CCAAT 
binding protdn) oc EBP (enhancer binding protein). The pl^^sio- 
lo^cal rok of this pnxein is foorly Undo^cood; as such, wc 
contbue to designate the protein merely arrorHing to its binding 
spedfidty (C/EBP). 

Ilie getie that eooxles C/EBP has been isolated as a rooooibuiant 
done, and the DNA binding domain of the protein has been 
localized to a 14*M> segment {1 1), The amino add sequence o£tbc 
C/EBP DNA bmding dtanain contains an abundance <^ residues 
with charged side chains, cspedally lysines and aiginines^ however, 
no prolines occur within dus re^on. Since proline residues arc 
sekkmi found in a heUces^ we ansmged the animo ad^ 
C/EBP on a schematic a hdix. When analyzed in this way, a 28- 
amino-actd segtnent of the DNA binding dcxnain cadubioed notable 
amphipathy (Fig. 1): One side of the hypotfaedcai helix was 
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predominajtcly ooinposed of faydroj^olnc amino adds (pardcakijy 
leudncs), wlille die ocfacr was oomposed of residues with chai)^ 
side chains (six basics and four acidks) and unchaigcd polar side 
chains (ghitaaunes, direonines> and sooncs). 

Most conspicuous was the periodic tepetfdcMi of kudoe xesklues. 
Leodncs appear at cmy seventh position ovor 
adds whfaiQ die DNA binding domain of CTEBP. This distribudco 
is not simply a fimcdon of ibt abundance of this residue since the 
28-aniino-acid vcffoa contains only one odker leudhe. In a 
er-assiittd search for related pcotdn sequences, we found substantial ' 
similarity becwoen OEBP and die pibducfc of the mouse' c^myc 
protD>oncogCDe (11), The two protons share 17 idcntirics within a 
SO-amino-acid tcffoa localized near their respective caxbokyl tcnni* 
nL "pie teuton of ^milarity between G/EBP and the tcansfoming 
£«otein c-Myccoinddes almost pecfocdy widi the pn^posed a heliz 
of C/EBP difiwn in Fig. 1. Having no^ the unusual repetition of 
leucines in C/EBP» we attanged die c-Myc sequenqe on a sdiematic 
a hdiz and discovered the same motif (Fig. 2). the seqkienoe of 
mouse c-Myc; starting 32 residocs ii^istream. fixim its carixso^ 
tetminu^, cadiilnts Icudnes at eveiy seventh .residue over eight 
hypotiictical turns <^ a hd jx. The same periodic array of leucines 
occurs in human c-Myc, a related'hnman protein termed n-MyC, and 
at dbree out of four positions in human L*Myc (Fi^ 2). The tingle 
deviatioB fiom this hepcad pcriodidi/ of IcudQcs was observed m 
human and consisted of a replacement by methionine. 

Amino add sequence siaularities liove been noted betweont c-Myc 
and two other nudear tran^brming ^teins, Fos and Jun (14). 
Moreover, both structural and funaioi»l evidence has indicated that 
Jim is. fenced to GCN4, a DKA binding protein fiom yeast that 
pla^ a direa role in regulating tianscripticm {i4, 15). When 
dispbycd on schematic a hdioes, we nodood that the relaxed 
srqncnccs of eadi of these proteins esdiibit at least four periodic 
repeats of leucine residues (Fig. 2). 

The invariant occurrence of at least four leucine repeats in five 
different protons is not tdlective of general sequence timilaiity. 
AUiough any two of the pKKeins share subscaiitial similar^ 
this intriguing region (that is, C/EBP is quite similar to c-Myc, and 
Jun is cVcn more similar to GC!N4), no single amino add, other than 
the four leucines, is conserved among all four proteins. Indeed, Jun 
arui GCN4 share only one odicr identity with OEBP within tliis 23- 
residue window, and Fos shares only two other Identities (see Fig. 
2). 

Predictioii of umisoal helix staftility. The leucine tcpat com- 
mon tb die aforemonioned protdns was observed as a coD^ 
of projecting arnino add sequences on an idealized a helix. If dicse 
s^meots of protdn actually exist in an a-hcUcal ccai£c»niatioQ, the 
predicted heUccs would be unusually long. Tbt distance fiom the 
£rst leucine to die fourth, 22 amino adds^ would require a 
minimurn of six hdical turns; C/EBP, Jun, and Fos atcuaily contain 
five kudoe repeats, which would span at least ei^ a-hdical turns. 
Two of die primary forces diat stalnlizc a helices are the ainphi- 
padiic arrangement of hydrc^hobic amino adds and die ocdmxnce 
of oppositely charged amino add pairs coD%ured in a manner that 
allows fonnaripn of a sak bridge {16-19), The disposition of 
hydrophobic residues on one side of an a hdix can provide a. 
contiguous array of stabilizing interaoicsis with the gk^nilar fold of 
a proteiiL In more pronounced cases, hydro^^ic^ interactions of 
this namre fedlitate die coiled-ocMl intertwining very long a 
helices found in keratins, lamins, and paran^osin {2G). 

The stahiKring infiuence <^ ion pairs, which depend on the 
appropriate juxtapositioa of addic and basic amino adds widiin an 
a helix (21), has been inferred fiom two Lines of evidence On die 
analytical side, computer searches the Brookhavcn Data Bank of 
solved protein structures have shown that a helices are relatively rich 
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in ton pahs. In a computer study of all idencififid a-hdical structures, 
Sundaralii^am et id, (18) notioed a proportional increase ip die 
fivqucnqr ci ion p^ as the size of the hdix lengthened. The dass 
including the largicst a helices^ whidi conasted of at least six heOcal 
turns, contained an average of 0.4 ion pair per tum.'Ih a separate 
study, Sundaralingam also examined the indikiioc of ton pairs in a 
eatery of a helices common to calmodulin ahd txoponia C (22). 
These unusual a he li c e s , whidi connect dual ^bular doniains 
witfiin each protfiin^ are exceptionally long, stable, and solven^ 
exposed; moreover, duy are extremely rich in intiahelical ton pairs, . 
oontaiiiing an avera^ density of 0.7 kn pair per heUcal tuciL 
apetintental side, short peptides with systematically varied amino 
adid sequences have been synthesized and tested for dicir propensity 
to adopt sohrent^stabk a hefical stzucturc. Using such an approach, 
Marqusee ^nd Baldwin {19) found diat the ^ropnare juxtapoti- 
tion of oppositdy diargcd reddues fostered hdix stability. 

As noted above, C/EBP exhibits an anq)hipathic array of hydro- 
phobic residues in the area diat shares sequence timilaiity widi c- 
Myc (see Figs, land 2). Mmover, this same rcgba of ^ . 
unusnaUy r^ in oppositdy diarged residues tliat are juxt^^os^ 
manner suitaUe for ion painng. An a-hdical di^ilay of the re^rm 
containing the five lendnc repeats of OEBP (Fig. 3) indicatts that 
eight ion pairs occur within die d^ helical tums tliat separare the 
fiTiSt kudrie fiom the fifih kiiridne. Two additional pairs occur i^ 
helicd. turns that precede the kaidne dosest to the amino tennin^ 

The ctenshy of icm pairs observed in C/EBP, roughly one per 
helical turn and cvcnty spaced thror^bout the 30-amino-acki r^^ 
of interest, is exceptionally high. Only helices known to be lot^ 
stable, arid solvent^j^osed banre ion p^ detisiiies appcoachin^ 
per torn. On this basis, wc predict tbut a substantial portion dT the 
DNA Ufiding domain pf CflBBP, whidi indudes the leucine repeat 
motif^ exists in a solvent-stable, a-helical coofomiation. IndBed, 
since die seqnenoe-spedfic DNA Innding activity of CVEHP is 
unuimally hea^stabk (li), we surest that tlie PNA bindi^ 
may occur widiin or adjacent to this hdix. 

Imeilidscal interdigititikMi of leucines: The z^per mbddL 
Why do eadi of the nuclear protdns diat we have ocmsidered 
ocMitain kudnc residues at every scvcndi position over at least six 
hdkal turns? The most obvious response to thtsqqestion is that die 
leudncs. establish amph^thy, ^^Ahich hdpis to ^^^m a kmg a 
hcHx. The ''helical wheeP plots of Figs. 1 and 2 show that C/EBP, 
GCN4, and Jun ccHirain hydrophobic residues at three out of four 
po^it2(»is on the ^'spc^ iknrnediately adjacent to that a 
four Icudnes. This arrangement estabiiishes a continuous ''spine? of 
bydrophc^dty over six to dght helical turns. The opposite &cc of 
eadi putative hdjx is rich in aniino adds with ctdier charged side 
chains or undiaiged polar side chains. Acoording.to our cc»iven- 
tiods, ''spokes^ S and 6 arc <^^posite fiom the ''^ke^ containing 
ieudnc residues. Out of the 40 residues tliat occupy ^^^kes" 3 and 
6 in the five prbtdns (C/EBP, c*Myc, FoSj Jun, and GCK4), 21 bear 
charg!»i side diains and 11 bear itndiarged pdar tide diains. 

Although amph^atiiy is a common feature of each of these 
putative a hdiocs, tiie d^;ree of sur&ce hydrophobidiy abng the 
noq>olar side is not rrm^^rfcably high^ Two of die proteins have 
almost no hydrophotnc amino adds at '^poke" positions to 
(he leucines; Fos has tasfy a single isoleudne to abet hydrdpfaobidcy, 
and c-Myc has onfy a single leircine (both are kx^ted at position 3 of 
"spokc^ 5; see Fig. 2). In these cases, surfikce hydrophobidty is 
limited to a very diin **ridgc* consisting of <me leucine reddue every 
odicr turn of the helix. Furthemwrc, if hydrt^hobidty were the 
only proporty ixocssary along the nonpolar side of dicse helices, it is 
perpkxingthac leucines would be used to the virtual cxchision of all 
Gtbar hydrophobic amino adds/ 

We have xacd a computer ptogram to generate a molecular modd 
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of C/EBP in the reg^QQ including its leucmc repeats (25). The side 
chains of C/EBP prpgrammed oaco the a carbon bickbohcof 
an tdfaliTwi a hx£x and displayed m diree dimensions , Tbe graphics 
program h^hiigfacs van dcr Waals radu in color* We chose to display 
die periodically r^eadng leudnc residues of C^BP in blue and aU 
remaining amino adcis 14 red. Figure 4 sho^ the putative OEBP 
helix as \icwed.fito(n dirae difietent pcr^)ecdve8 around die hong 
axis of the helix, cadi difiering by 90**. 

Ihre&dimcosional projections of OEBP sljow that the van dcr 
Waals xadii of the repeating kudoe side chains are insufficiem^ 
dose to provide sfahilirine imrahclicai interactioos. Ibis observar 
tioo retnferced the need to finda hydrophobic sut&oe to match the 
thin *^Ddg^ of hydrophobidty. Sudia smEacc could adst within the 
globular 'fold of the same polypeptide. Atocmaiivdy, as in the case of 
coiled*^ a helices^ the a>mplemematy.suEto 
by a separate polypepdde. Our attifinion became fbcused on the 
latter possibility several xtasons. (i) The lendneiiiodf maintains 
stria adherenjK to a hcptad repeat. In no case was t^ 
die repeat sluftod by even a single amino add. The hepcadr^eat is 
the qiqotessential feature shared by all proteins that adopt a cdled- 
coil structure (20). (ii)Thehy<hx>plK)bicsurfeoeoftfaclcudnerqpcat 
motif seemed unusually thin ' if the heUx were cp exist widiin a 
gbbtdar fold, the prc^drcics of diat foJd wot^ 
date the unusual aspeds of the hdix (the tia^ 
phobic *V^dge," ah abundance of charged amino adds, aiid an 
abundance uncharge polar amino adds). 

ppwe assume diat die "ridgc^ of hydrophobidty of each helix 
requiics a matching sur&oe, and consider that this snt&ce mi g h t be 

dcMaied by a separate pplypqiridg, pi>rhaps the he«<- canJiHaty w rfi^ 

matching sur&ce domoed by the same hdxx from a second mono- 
mer. That is, the hydrophobic sur&oes of two leucine hdioes might 
interact to form a: dimer. Ahboi^ based on the aukd-coil parar 
d^m, tiiis faypodieticai stricture is distinctiye 
the inten^tion surfioe is relativdy short Keratios, lamiii^ 
prottios that adopt a ccMlcd-coil structure rely on dozens or even 
hundreds of hqptad repeats, resulting in extensive^ inficnvovch 
helices (20). Moreover, the snterhelical hydrophobic intetactions of 
those pixneins can be established by almost any hydroj^ic^c 
add. The motif that we propose covers only four or five hepead 
repeats, and depends almost cxdiisiv^ on leucine side chains. 

Wh^ is leudne used to the virtual cn^ion <€aU other hydropho- 
bic residues? An examination of the stcucturai propeities.of hydro- 
{^lobic anuno adds may be instructive in this tq;suxL 
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Hie leucine side chain iias two methyl groups extending from its 
singk 7 carbon, arid no mediyl giqqis appended to its ^ carbon. As 
sudi, its side chain is lon^ symmetrical, and bulky at die .dp. We 
predict tliat these properties aOow die kuctne residues from one a 
helix to intecdigitaee with those of a second a helix^ fonoiiiig a 
molecular dpper between two polypeptides (Fig. SA). It is impor- 
tant to note that an ideal a helix contains slig^itly more than 3.5 
residues per turn. Therefore, if our hypothesis is correct, die paiied 
hehces must be distorted in order Co maintain maximum paddiig of 
the leudne side.diains. 
Our model of ordered interdigitation may aoooum for the cxdu- 
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sive use of leucine. like the tooth of a zipper, the leucine side diain 
is rdadvely long and buUy at its tip. l^Ieudnc also contains an 
extended side diain tiiat is appended with two metix]^ groups^ 
However, otic of the mediyl gixnips is attadicd to the 3 carbon of 
the isoleudne side chain. We propose that tbc prcjectkm of a mediyl 

groi^ fit>m a ^ carbon would interfere sterically with ti» sequential 
intctdigitation of hydrophobic side chains. According to these 
intetpretadons, valine would ocmstmne a particularly ill-suited 
amino add. Its side diain is short (relative to leudne) and is 
af^cnded with two methyl groups extending fiom die p carbon 
atom. Rather than acting to lode two hdioes tc^ethcr, valine wtMild 
block intcrdigitatkm. Given these oonsidcradons, methionine ml^ 
be expected to constitute die most suitable ahemative to leucine. 
Like leucine, its p carbon is fiec of attached methyl groups. 
Moreover,'it confafas both a methyl ffoup and a bully sulfur atom 
at the dp of its side chain. Our of 31 heptad repeats analyzed in our 
stnd^, we observed 30 kudncs and <Me mcAionine (sec.Fig. 2). 

The modd that we have presented makes the explicit prediction 
that.tfae "leudhc zipper* would represent the dimciizaiiQn domain 
of this class of DNA binding proteins. Aldiou^ this [xcdictioo has 
iiot been tested <Urecdy, k is known that GCN4 and 
sc^ndon as stable dimeis (24), Moreover, the domains cf both 
proteins cntical for dimenzadon map to r^zons coinddecict widi 
DNA binding, which indudes the penodic leudne r^)eat5. The 
posability that a stable diirorizati^ 

an a helix of less than ten turns has been democstratpd in model 
systems whh syntfacdc peptides (25). 

If our *1eudne zippo^ model is correct, then the a helices must 
interlod^ in one of two mentations, paraHel or and*parallel idadve 
tp the ammo-tD-carbaxyl dipok of eadi hdk. OystallogEaphk 
studies have shown that helix packing can occur in ddierttkntatkm 
(17). Howeyqr, all precedents for coifed^xnl interaction between 
two polypq>tides adopt a paralld oonfotmatipn (20), Despite this 

wc sevexal reas(»s to anddpte d^ 
tnodf exists in an andparallel oonfinxnation. ]^rst, die int%mat;^ > 
decree of side chain paddng implidt to our modd mig|it oiily be 
compatibk widi antiparalkl hetioes. Amino add skic chains are 
disposed at an an^ pointing toward the amino terminus of the 
hela (Fig. 4); as ^ch, an antqparalki arrangement might be better 
suioed for skie diain inteffoddng. Second, an andpaiaBd oonfoimar 
don would alkywthe d^ok of oiie ^dix to attnux, radier th^ 
the d^k of the mstxMng helix. 

Wc atmdpate that the issue of helical cdentadon wiU be cs^ 
the {node by whichifaese proteins bind DNA. Either ooofbn 
paraiki or antqparalldl, results in ^ rpcadooafly symmeocic m^tfrwk' 
(Fig. 5B). In an annparaQd oonfotmation, the axis of rotadon 
wouU be peipendicuibr to the paired Vl«pcyj, whereas in a paraUd 
confocmatkm, the axis would be in line with the hdioes. The 
rekvance of helix orientation is undersoorod by the &a diat the 
"'kudoe ^ppei^ aicKic is not su£kieiit to coxifo seqacooe^spedfic 
interaction with DNA* Anuno add sequence analysis oTOEBP has 
shown that tlie protein a;»itains a higjh prc^pmioo of bask residues 
iri a 3p-aniino-add r^on immediate^ ad|acBnt to its ^leudne 
2jppec^ (11). Hiis highly bask r^bn of OEBP cxhibhs substantial 
sequence similarity to a region of the Fos-transfomiing |«otein diat 
is juxtaposed idccodcal^ to its "^dnc zipper^ (11). Moreover, the 
OEBP baMc region must remain intact in order for OEBP to bind 
DNA (2d). We pcedkt that die "leudi^ apper' juxtaposes tlx 
rr^bns of two polypeptides in a manner suitahk for sequence^ 
^odBc recognition cSfDNA (Fig. 6). 

By comparing the amino add sequences of several DNA binding 
proteins we have discovered a repeating motif of kudne rcskiues. 
We prc^iose that ditsc leucines pio^ fiom oosz^iarativdy long, 
spable^ sohrenc-exppsed a hdices, and that the kudne residues that 
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sJtti ^' Three-dimensional model of the hdical leucine repeat region of 

• IbP- The amino acid side chains of C/EBF between leucine 315 and 
^^ •^c 343 were appended onro the a carbon baddsone of an idealized a 
'P^'Jix and displayed in three dimenstonf; by means of a computer ^aphics 
:omjpgfam (23). Leucine residues that occur at every seveodi residue are 
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5. Schematic diagram showing hy|K)thetical intcrdigitanon of leucine 
ion Ifc chains between tw o helices- (A) Two parallel tubes represent die 
ccnlproximatc dimensions of the a carbon batkbonc of idealized a helices, 
j^j- icrdiptating protrusions symbolize leucine side chains. The sphere located 
the tip of each residue lejircsciits the two methyl groups attached to the 7 
"jbon atom of the leucine side chain. (B) OEBP dimers disjH^sed in cither 
l^Ti antiparalkl (left) or parallel array (right) with respect to the leucine 
bcWt- Thick arrows withirt helical cylmdcrs denote amino-to-carboxyi 
arc fol*^- Thin arrows denote axis of rotational synrnieiry. 
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Fig. 6. Disposition of 
the leucine repeat and 
DNA binding domain 
within ihc intact 
OEBP polypeptide. 
(A) Tx>cation of die 
leudnc repeat within 
the intact OEBP poly- 


fpddc. The region necessary for sequence-specific interaction with DNA 
inf - JCoids bcycmd the leucine repeat towmid the amino-tcmunus of die protein, 
icS*.^ includrs a 30-amino-acid rc^on that is highly positivdy cliarged 
iig,:fipplcd). (B) A hypothetical modd of a OEBP dimcr established by the 
{)ar ^^^gl^tation of leucine repeat helices in an antiparalkl confonnadon. 
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displayed in bhie, all other residues arc displayed in red. Tlie amino-to- 
carbocyl dipoic cdFthe helix is arranged left to right. Each succcssh^e view of 
the hdix is rotated by 90" around an axis parallel to the helix. Note diat the 
distance between leucine residues exceeds die van dcr Waals radii of the side 
chain R groups and that side chains tik toward die amino terminus of the 
hdix. 

project from one a hdix intcrdigieate with those of a second bclb^ 
causii^ the two molecules to dimcrizc Wc believe that this nK)df, 
dac **lcucinc zipper," represents a part of the sca&ld that molds a 
protein to interact with its target site on DNA. Finally, we point out 
the possibility that the "leucine zipper^ might allow dimcrization of 
different polypeptides so long as each subunit contained the motif. 
Indeed, recent evidence has raised the possibility that the Fos and 
Jun transforming proteins, which we have shown to contain the 
leucine repeat motit form a heterotypic complex (27). The nodon of 
heterotypic intcracdons raises potentially important implicadons 
relating to the combinan^rial action of gene re^atory proteins, and 
may fecihtate a more direct attack on the funcdon of nuclear 
oncogenes. 
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Fig. 4. Thiee-dinKDsbQal xoodd of the hdkaS leucine repeat regkm of 
OEBP. The amino add side cfaaizs o£ OEBP between ieudne 315 and 
Icuonc 343 iiim appended ODCO ifac a caiboo bac^^ 
helix and di^>layod in tfatce dixnenskms bf means cf a ooo^uter graphics 
'pxogiam (23). Leoctne lealdnes diat occur at every seventh residae aie 


di^hyed in blue, aU other cesiducs are di^ktyed in red. The 

caxtxnyl dipok of the he2iz is axranged kft to £^ 

the hetix is romed by 9(r aroaixl an axis paialkt to the hdii. N 

distance bermn kucine rcsidxies cgreeeds the van 

diain K groups and that side chains tsk toward the amino teoaunns of the 

helix. 



B 




Fig. 5, Schanadc diagiam showing hypothetical tntcrdigitation of kudne 
Side duios between two a hchocs. (A) Twq paralH tubes nsresem the 
approodmate d imfen si o ns of the a camn baitbone of idealtzod a hdkes. 
Interdighadi^ protriKions qmobolize 1^ 


at the tip of dbm residiK rqntsems the two methyl gr^ 
carbon atom of the kudiK side chain. (B) C/EBP dimcxs disposed in either 
an antiparalld (left) or paralld array (right) widi respect to the leucine 
repeat. Thkk arrows wnhin helical cylitKkrs deanote anmioio-carboxyi 
dipole. Thin arrows denote axis of rota&nal symos^^ 
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F^ 6. Disposition of 
the kudnc rc|>eat ami 
DNA bandiiig domain 
within the imaa 
G/EBP polypeptide. 
(A) Locatton of the 
leucine repeat within 
the intatt C/EBP poly* 

p^tide. The r^lon necessary for seqnence-qxdfic ineeiaciioa widk DNA 
exteiub beyond the Leucine repeat toward the amixio^ 
and indudcs a 30-amino-add region that is highly posttivdy charged 
^stippled). (B) A i^podiedcal model of a CVEBP dimcr established by the 
hneidigitation of Jcuooe repeat helices in an antiparalld oonibKmation. 
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piojea fiom one a hdix intmligiLtate with 
causing the two molecules to dunerize. We belkve that this ina6£, 
the 'teudiie zipper," represans a pair of the scaflbld that molds a 
ptotein to interaa with its target site oa DNA. Fnialty, poiin^ 
the possibility that the "Icucme z^pci* iiug^ alk)w 
differect po^pepddcs so loi^ as eadi subunit contained die mod£ 
Tnric cd, recent evidence has raised the possibility that the Fos atid 
Jun transfomung [xotcins, which we have shown to oontain the 
Eeudne repeat taooE, form a heterotypic oomiribc (27). Thenotkmof 
hcterotyjnc mceraccions raises potential^ iapottant implications 
telatizig to the comhtnatorial acdonof gene rcguJatDiy pra£e]ns> and 
may fadlitate a more direct atcadi^ on the function of nuclear 
Qncogene& 
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